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^ . Abstract 

3^ ' Motivated by issues of saviiiff energy in data centers we define a collection of new problems referred 

^— ^ ' to as "machine activation" problems. The central framework we introduce considers a collection of 

, m machines (unrelated or related) with each machine i having an activation cost of . There is also 

O ' a collection of n jobs that need to be performed, and pij is the processing time of job j on machine i. 

O ' Standard scheduling models assume that the set of machines is fixed and all machines are available. 

I However, in our setting, we assume that there is an activation cost budget of A - we would like to 

select a subset S of the machines to activate with total cost a{S) < A and find a schedule for the n 
jobs on the machines in S minimizing the makespan (or any other metric). 

We consider both the unrelated machines setting, as well as the setting of scheduling uniformly 
related parallel machines, where machine i has activation cost and speed s^, and the processing 
time of job j on machine i is pij = where pj is the processing requirement of job j. 

For the general unrelated machine activation problem, our main results are that if there is a 
c/3 I schedule with makespan T and activation cost A then we can obtain a schedule with makespan 

O^. (2 + e)T and activation cost 2(1 + 7)(ln -Qppp + ^)A, for any e > 0. We also consider assignment costs 

for jobs as in the generalized assignment problem, and using our framework, provide algorithms that 
minimize the machine activation and the assignment cost simultaneously. In addition, we present 
a greedy algorithm which only works for the basic version and yields a makespan of 2T and an 
activation cost A{1 + Inn). 
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Xi ' For the uniformly related parallel machine scheduling problem, we develop a polynomial time 

ll;' ' approximation scheme that outputs a schedule with the property that the activation cost of the 

' subset of machines is at most A and the makespan is at most (1 + e)T for any e > 0. For the special 

case of TO identical speed machines, the machine activation problem is trivial, since the cheapest 
' subset of k machines is always the best choice if the optimal solution activates k machines. In 

addition, we consider the case when some jobs can be dropped (and are treated as outliers). 

a^ ■ 

^ ' 1 Introduction 

Large scale data centers have emerged as an extremely popular way to store and manage a large 
^ ' volume of data. Most large corporations, such as Google, HP and Amazon have dozens of data centers. 
■ These data centers are typically composed of thousands of machines, and have extremely high energy 
requirements. Data centers are now being used by companies such as Amazon Web Services, to run 
large scale computation tasks for other companies who do not have the resources to create their own 
data centers. This is in addition to their own computing requirements. 

These data centers are designed to be able to handle extremely high work loads in periods of peak 
demand. However, since the workload on these data centers fluctuates over time, we could selectively 
shut down part of the system to save energy when the demand on the system is low. Energy savings 
results not just from putting machines in a sleep state, but also from savings in cooling costs. 

Hamilton (see the recent SIGACT News article [3]) argues that a ten fold reduction in the power 
needs of the data center may be possible if we can simply build systems that are optimized with power 
management as their primary goal. Suggested examples (summarizing from the original text) are: 
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1. Explore ways to simply do less during surge load periods. 

2. Explore ways to migrate work in time. The work load on modern cloud platforms is very cyclical, 
with infrequent peaks and deep valleys. Even valley time is made more expensive by the need to 
own a power supply to be able to handle the peaks, a number of nodes adequate to handle surge 
loads, a network provisioned for worst case demand. 

This leads to the issue of which machines can we shut down, since all machines in a data center are 
not necessarily identical. Each machine stores some data, and is thus not capable of performing every 
single job efficiently unless some data is first migrated to the machine. We will formalize this question 
very shortly. 

To quote from the recent article by Birman et al. (SIGACT News [3]) "Scheduling mechanisms that 
assign tasks to machines, but more broadly, play the role of provisioning the data center as a whole. As 
we'll see below, this aspect of cloud computing is of growing importance because of its organic connection 
to power consumption: both to spin disks, and run machines, but also because active machines produce 
heat and demand cooling. Scheduling, it turns out, comes down to deciding how to spend money." 

Data is replicated on storage systems for both load balancing during peak demand periods, as well 
as for fault tolerance. Typically many jobs have to be scheduled on the machines in the data center. In 
many cases profile information for a set of jobs is available in advance, as well as estimates of cyclical 
workloads. Jobs may be I/O intensive or CPU intensive, in either case, an estimate of its processing 
time on each type of machine is available. Jobs that need to access specific data can be assigned to any 
one of the subset of machines that store the needed data. Our goal is to first select a subset of machines 
to activate, and then schedule the jobs on the active machines. From this aspect our problems differ 
from standard scheduling problems with multiple machines, where the set of active machines is the set 
of all machines. Here we have to decide which machines to activate and then schedule all jobs on the 
active machines. 

The scheduling literature is vast, and one can formulate a variety of interesting questions in this 
model. We initiate this work by focusing our attention on perhaps one of the most widely studied 
machine scheduling problems since it matches the requirements of the application. We have a collection 
of jobs and unrelated machines, and need to decide which subset of machines to activate. The jobs can 
only be scheduled on active machines. This provides an additional dimension for scheduling problems 
that was not previously considered. This situation also makes sense when we have a certain set of 
computational tasks to process, a cost budget, and can purchase access to a set of machines. 

One fundamental (and well studied) scheduling problem is as follows: Given a collection of n jobs, 
and m machines where the processing time of job j on machine i is pij, assign all jobs to machines 
such that the makespan, i.e., the time when all jobs are complete, is minimized. This problem is widely 
referred to as unrelated parallel machine scheduling \n\ I20|. If machine i does not have the data that 
job j needs to run, then we set pij = oo, otherwise the processing time pij is some constant pj which 
only depends on job j. This special case is the so-called restricted scheduling problem and known to 
be A^P-hard. However, if a schedule exists with makespan T, then the polynomial time algorithm 
developed by Lenstra, Shmoys and Tardos [T7] shows an elegant rounding method to find a schedule 
with makespan 2T. The subsequent generalization by Shmoys and Tardos [2^, shows in fact that even 
with a cost function to map each job to a machine, if a mapping with cost C and makespan T exists, 
then their algorithm finds a schedule with cost C and makespan at most 2T. 

Motivated by the problem of shutting down machines when the demand is low, we define the following 
"machine activation" problem. 

Given a set J of n jobs and a set M of m machines, our goal is to activate a subset S of machines 
and then map each job to an active machine in S, minimizing the overall makespan. Each machine has 



an activation cost of a,,. The activation cost of the subset S is a{S) = Ylies'^i- show that if there 
is a schedule with activation cost A and makespan T, then we can find a schedule with activation cost 
2(l + i)(ln jjPj^ + 1)^4 and makespan (2 + e)r for any e > by the LP-rounding scheme (we call this is 
a ((2 + e), 2(1 + i)(ln + l))-approximation). We also present a greedy algorithm which gives us a 
(2, 1 + In n)-approximation. Actually, the Inn term in the activation cost with this general formulation 
is unavoidable, since this problem is at least as hard to approximate as the set cover proble for which 
a (1 - e)lnn approximation algorithm will imply that NP C DTIME{n'^^^°^^°s'^'^) [5]. 

We also show that the recent PTAS developed by Epstein and Sgall [7] can be extended to the 
framework of machine activation problems for the case of scheduling jobs on uniformly related parallel 
machines. (The original PTAS by Hochbaum and Shmoys [12] is slightly more complicated than the 
method suggested by Epstein and Sgall [7j.) 

We also consider a version of the problem in which a subset of the jobs may be dropped to save 
energy (recall Hamilton's point(l)). In this version of the problem, each job j also has a benefit ttj and 
we need to process a subset of jobs with total benefit of at least 11. Suppose that a schedule exists with 
cost Cn and makespan Tn that obtains a total benefit at least 11. We show that the method due to 
Shmoys and Tardos [20] can be extended to find a collection of jobs to perform with expected benefit 
at least 11 and expected cost Cn, with a makespan guaranteed to be at most 2Tn (see Appendix!^ . 
(The recent work by Gupta et al. [TT] gives a clever deterministic scheme with makespan 3Tn and cost 
(1 + e)Cn along with several other results on scheduling with outliers. This has been further improved 
to (2 + e)Tn and cost (1 + e)Cn in [IE].) 

1.1 Related Work on Scheduling Generalizations of the work by Shmoys and Tardos |20| . have 
considered the Lp norm. Azar and Epstein [2] give a 2- approximation for any Lp norm for any p > 1, 
and a v^-approximation for the L2 norm. The bounds for p ^ 2 have been subsequently improved |16j . 

In addition, we can have release times rjj associated with each job - this specifies the earliest time 
when job j can be started on machine i. Koulamas et al. [15] give a heuristic solution to this problem 
on uniformly related machines with a worst case approximation ratio of 0{^/rn). 

Minimizing resource usage has been considered before. In this framework, a collection of jobs J 
needs to be executed ~ each job has a processing time pj, a release time rj and a deadline dj. In the 
continuous setting, a job can be executed on any machine between its release time and its deadline. 
In the discrete setting each job has a set of intervals during which it can be executed. The goal is to 
minimize the number of machines that are required to perform all the jobs. For the continuous case, 
Chuzhoy and Codenotti [1] have recently developed a constant factor approximation, improving upon a 
previous algorithm given by Chuzhoy et al [5] . For the discrete version Chuzhoy and Naor [6] have shown 
an r2(loglogn) hardness of approximation. However this framework does not model non-uniformity of 
machines, which is one of the key issues in data centers. In addition, non-uniformity of activation costs 
is not addressed in their work neither. 

1.2 Related Work on Energy Minimization Augustine, Irani and Swamy [1] develop online 
algorithms to decide when a particular device should transition to a sleep state when multiple sleep 
states are available. Each sleep state has a different power consumption rate and a different transition 
cost. They provide deterministic online algorithms with competitive ratio arbitrarily close to optimal 
to decide in an online way which sleep state to enter when there is an idle period. See also the survey 
by Irani and Pruhs for other related work |14j . 



^This is easy to see — we can view a set cover instance as a bipartite graph connecting elements (jobs) to corresponding sets 
(machines). If the element belongs to a set, then the processing time of the corresponding job on the corresponding machine is 0, 
o.w. it is CO. An optimal set cover solution corresponds to an optimal set of machines to activate with makespan. 



1.3 



Our Contributions Our main contributions are: 



• A randomized rounding method that approximates both activation cost and makespan for 
unrelated parallel machines. This method is based on rounding the LP solution of a certain 
carefully defined LP relaxation and uses ideas from work on dependent rounding [1U\ (Section 
2). 

• Extensions of the above method when we have assignment costs in addition to activation costs as 
part of the objective function (Section 3). 

• A greedy algorithm that approximates both activation cost and makespan for unrelated parallel 
machines and gives a (2, 1 + In n)-approximation (Section 4). 

• Extensions of these results to the case of handling outliers using the methods from [11] as well as 
release times (Section 5). 

• A polynomial time approximation scheme for the cost activation problem for uniformly related 
parallel machines extending the construction given for the version of the problem with no activation 
costs [7] (Section 6). 

• A simple dependent rounding scheme for the partial GAP problem (Appendix 
2 LP Rounding for Machine Activation on Unrelated Machines 

In this section, we first provide a simple roundinging scheme with an approximation ratio of 
(O(logn), O(logn)). Then we improve it to a (2 + e,2(l + i)(ln^jpy + l))-approximation by a new 
rounding scheme. We can formulate the scheduling activation problem as an integer program. We 
define a variable yi for each machine i, which is 1 if the machine is open and 0, if it is closed. For 
every machine-job pair, we have a variable Xij, which is 1, if job j is assigned to machine i and is 0, 
otherwise. In the corresponding linear programming relaxation, we relax the yi and Xij variables to be 
in [0, 1]. The first set of constraints require that each job is assigned to some machine. The second set 
of constraints restrict the jobs to be assigned to only active machines, and the third set of constraints 
limit the workload on a machine. We require that 1 > Xij,yj > and if pij > T then Xij = 0. The 
formulation is as shown below: 

m 

min^Oi?/j (2.1) 

i=l 

S.t. Xij = 1 \/j £ J 

Xi,j <yi Vi G M,j £ J 
i,j < Tyi yi 

j 

Suppose an integral solution with activation cost A and makespan T exist. The LP relaxation will 
have cost at most A with the correct choice of T. All the bounds we show are with respect to these 
terms. In Section 2.2 we show that unless we relax the makespan constraint, there is a large integrality 
gap for this formulation. 



2.1 Simple Rounding We first start with a simple rounding scheme. Let us denote the optimum 
LP solution by y, x. The rounding consists of the following four steps: 

1. Round each i/j to 1, with probability yi and with probability 1 — yi- If 2/i is rounded to 1, open 
machine i. 

2. For each open machine i, consider the set of jobs j, that have fractional assignment > on machine 
i. For each such job, set Xij = If ^jPijXi j < T, (it is always < T) then uniformly increase 
Xij. Stop increasing any Xij that reaches 1. Stop the process, when either the total fractional 
makespan is T or all ^ij's are 1. If Xij = 1, assign job j to machine i. If machine i has no job 
fractionally assigned to it, drop machine i from further consideration. For each job j that has 
fractional assignment Xij, assign it to machine i with probability Xij. 

3. Discard all assigned jobs. If there are some unassigned jobs, repeat the procedure. 

4. If some job is assigned to multiple machine, choose any one of them arbitrarily. 

In the above rounding scheme, we use y^'s as probabilities for opening machines and for each opened 
machine, we assign jobs following the probability distribution given by Xij^s. It is obvious that the 
expected activation cost of machines in each iteration is exactly the cost of the fractional solution given 
by the LP. The following lemmas bound the number of iterations and the final load on each machine. 

Lemma 2.1. The number of iterations required by the rounding algorithm is O(logn). 

Proof. Consider a job j. In a single iteration, Pr( job j is not assigned to machine i ) < {l — yi) + yi{l — 
^) = 1 - Xij. Hence, 

Pr( job j is not assigned in an iteration ) 

< Uii - X,,) < (1 - -r < - 

me 

i 

The second inequality holds since Xij = 1 and the quantity is maximized when all s are equal. 
Then, it is easy to see the probability that job j is not assigned after 2 Inn iterations is at most 
Therefore, by union bound, with probability at least \ — all jobs can be assigned in 2 Inn iterations. 

□ 

Lemma 2.2. The load on any machine is 0(Tlogn) with high probability. 

Proof. Consider any iteration h. Denote the value oi X^j at iteration h, by X-^y For each open machine 
i and each job j, define a random variable 

, if iob 7 is assigned to machine i 
T , J J B ^2.2) 

, otherwise 

Clearly, < Zij^h < 1- Define, Zi = Y^j^h^iJA- Clearly, 

E[^.] = 5^L^^^!:^<5;i<e(logn) 



Denote by Mi the load on machine i. Therefore, Mj = TZj, thus E[Mj] < G(nogn). Now by the 
standard Chernoff-HoefFding bound [131 [E], we get the result. □ 



2.2 Integrality Gap of the Natural LP, for Strict Makespan Let there be m jobs and m 
machines. Call these machines Ai, A2, Am-i, and B. Processing time for all jobs on machines 

1 is T and on i? it is ^. Activation costs of 1, and 

for B it is very high compared to m, say R{R >> m). An integral optimum solution has to open 
machine B with total cost at least R. 

Now consider a fractional solution, where all machines Ai,A2, ..,Am-i are fully open, but machine 
B is open only to the extent of 1/m. All jobs are assigned to the extent of 1/m on each machine 
Ai,A2, ..,Am-i- So the total processing time on any machine Ai is = T. The remaining ^ part of 
each job is assigned to B. So total processing time on B is ' = m- ^^^^ optimal 
fractional cost is at most m + ^ (by setting ys = Therefore, the integrality gap is at least ~ m. 

2.3 Main Rounding Algorithm for Minimizing Scheduling Activation Cost with Makespan 
Budget In this section, we describe our main rounding approach, that achieves an approximation factor 
of 2(1+ i) (In + for activation cost and (2 + e) for makespan. Based on this new rounding scheme, 
we show in Section [3] how to simultaneously approximate both machine activation and job assignment 
cost along with makespan, and how to extend it to handle outliers, when some jobs can be dropped 
(Section [S]). For the basic problem with only activation cost and makespan, we show in Section 4, that 
a greedy algorithm achieves an approximation factor of (2, 1 + Inn). However, the greedy algorithm is 
significantly slower than the LP rounding algorithm, since it requires computations of (m — i) linear 
programs at the ith step of greedy choice, where i can run from 1 to min{m, n) and m, n are the number 
of machines and jobs respectively. 

The algorithm begins by solving LP (Eq (j2.ip ). As before x, y denote the optimum fractional solution 
of the LP. Let M denote the set of machines and J denote the set of jobs. Let |M| = m and \J\ = n. 
We define a bipartite graph G = (M U J, E) as follows: M D J are the vertices of G and e = G E, 
if Xij > 0. The weight on edge is Xij and the weight on machine node i is yi. Rounding consists 
of several iterations. Initialize X = 5c and Y = y. The algorithm iteratively modifies X and Y, such 
that at the end X and Y become integral. Random variables at the end of iteration h are denoted by 
X^j and yA. 

The three main steps of rounding are as follow: 

1. Transforming the Solution: It consists of creating two graphs Gi and G2 from G, where Gi has 
an almost forest structure and in G2 the weight of an edge and the weight of the incident machine 
node is very close. In this step, only Xij's are modified, while Yi's remain fixed at y^'s. 

2. Cycle Breaking: It breaks the remaining cycles of Gi and convert it into a forest, by moving certain 
edges to G2. 

3. Exploiting the properties of Gi and G2, and rounding on Gi and G2 separately. 
We now describe each of these steps in detail. 

2.4 Transforming the Solution We decompose G into two graphs Gi and G2 through several 
rounds. Initially V{Gi) = V{G) = MU J, E{Gi) = E{G), ^(Ga) = M and ^(Ga) = 0. In each round, 
we either move one job node and/or one edge from Gi to G2 or delete an edge from Gi. Thus we always 
make progress. An edge moved to G2 retains its weight through the rest of the iterations, while the 
weights of the edges in Gi keep on changing. 

We maintain the following invariants, 



(II) V(z,j) G EiGi), and V/i, X^^^^ G (0,2/^/7), P^,J > 0. 



VjGJ', J2 = 1- Yl (2-3) 



ii,j)GEiGi) (ij)^E(G2) 



\/ieM', Yl PiJ^'iJ = Yp^'^^^J- Y Pid'>^i'j (2.4) 
jeJ', j&J' jeJ', 

{i,j)eE{Gi) {i,j)eE{G2) 

Figure 1: Linear System at the beginning of iteration (h + 1) 

(12) Vi G M and V/i, ^ ■ X^^^-p,,, < Ty^. 

(13) V(i, j) G ^(G2) and V/i, 1 > > y^/j. 

(14) Once a variable is rounded to or 1, it is never changed. 

Consider round one. Remove any machine node that has Yj^ = from both Gi and G2. Activate 
any machine that has Y^^ = 1. Similarly, discard any edge (z,j) with Xlj = 0, and if X^j = 1, assign 
job j to machine i and remove j. If Xf j > yi/j, then remove the edge (i, j) from Gi and add the job j 
(if not added yet) and the edge (i,j) with weight Xij{> yi/^) to G2. Note that, if for some («,j) G G, 
Pi,j = 0, then we can simply take Xij = yi and move the edge to G2. Thus we can always assume for 
every edge G Gi, pij > 0. It is easy to see that, after iteration one, all the invariants (11-14) are 
maintained. 

Let us consider iteration {h + 1) and let J',M' denote the set of jobs and machine nodes in Gi with 
degree at least 1 at the beginning of the iteration. Note that Y-^ = Y^=yi for all h. Let \M'\ = m' and 
\J'\= n' . As in iteration one, any edge with X^- = in Gi is discarded and any edge with X^- > yi/'^ 
is moved to G2 (if node j does not belong to G2, add it to G2 also). We denote by Wij the weight of 
an edge (i, j) G G2. Any edge and its weight moved to G2 will not be changed further. Since wij is 
fixed when («, j) is inserted to G2, we can treated it as a constant thereafter. Consider the linear system 
(Ax = b) as in Figure [TJ 

We call the fractional solution x canonical, if Xjj G {0,yi/^), for all Clearly {Xi^j}, for 

(i,j) G E{Gi) is a canonical feasible solution for the linear system in Figured! Now, if a linear system 
is under-determined, we can efficiently find a non-zero vector r, with Ar = 0. Since x is canonical, 
we can also efficiently identify strictly positive reals, a and /3, such that for all {i,j),Xij + arij and 
Xij — Prij lie in [0, yi/j] and there exists at least one such that one of the two entries, Xij + otVij 

and Xij — Prij, is in {0, yi/'j}- We now define the basic randomized rounding step, RandStep(A, x, b) : 
with probability -^^^ , return the vector x -|- ar and with complementary probability of -^^^ , return the 
vector X — /5r. 

If X = RandStep(A, X, b), then the returned solution has the following properties |16| : 

Pr(AX = b) = l (2.5) 

E [Xij] = Xi,j (2.6) 

If the linear system in Figure[T]is under-determined, then we apply RandStep to obtain the updated 
vector X'^"''^. If for some X^j'^ = 0, then we remove that edge (variable) from Gi. If X^^^^ = Vi/l, 

then we remove the edge from Gi and add it with weight yi/'j to G2. Thus the invariants (II, 13 and 
14) are maintained. Since the weight of any edge in G2 is never changed and load constraints on all 
machine nodes belong to the linear system, we get from |16j . 



Lemma 2.3. For all i,j,h,u, E 



yh+l 



Xi j. Also for each 



= u. In particular, E 

machine i and iteration h, J2j -^ijPi,j — J2j^i,jPi,j probability 1. 

Thus the invariant (12) is maintained as well. 

If the linear system (Figure [1]) becomes determined, then this step ends and we proceed to the next 
step of "Cycle Breaking". 

2.5 Cycle Breaking: Let M' and N' be the machine and job nodes respectively in Gi, when the 
previous step ended. If \M'\ = m' and |A^'| = n', then the number of edges in Gi is |£'(Gi)| < m' + n' . 
Otherwise, the linear system (Figure [1]) remains underdetermined. Actually, in each connected 
component of Gi, the number of edges is at most the number of vertices due to the same reason. 
Therefore, each component of Gi can contain at most one cycle. 

If there is no cycle in Gi, we are done; else there is at most one cycle, say G = {vo,vi,V2, . . . ,Vk = vq), 
with vq = Vk € M, in each connected component of Gi. Note that since Gi is bipartite, G always has 
even length. For simplicity of notation, let the current X value on edge et = {vt-i.,vt) be denoted 
by Zt. Note that if vt is a machine node, then Zt S {0,yvt/7)j slse vt-i is a machine node and 
Zf £ {0,yvt-i/l)- We next choose values fii, fi2, ■ ■ ■ , fJ'k deterministically, and update the X value of 
each edge et = {vt-i,vt) to Zt + fj-t- Suppose that we initialized some value for fii, and have chosen the 
increments fii, fi2, ■ ■ ■ , fJ-t, for some t > 1. Then, the value nt+i (corresponding to edge et+i = {vt, vt+i)) 
is determined as follows: 

(PI) If Vt J (i.e., is a job node), then fit+i = —l^t (i-e., we retain the total assignment value of wt)\ 

(P2) If Vt G M (i.e., is a machine node), we set ^it+i in such a way so that the load on machine vt 
remains unchanged, i.e., we set ^t+i = —Pvt,vt-il^t/Pvt,vt+n which ensures that the incremental 
load Pvt,vt-ilJ't + Pvt,vt+if^t+i is zero. Since Pvt,vt+i is non-zero by the property of Gi therefore, 
dividing by Pvt,vt+i is admissible. 

The vector fi = (^i,/i2, ■ • • ,/^fe) is completely determined by ^i, for the cycle G. Therefore, we can 
denote this fi by /(/^i). 

Let a be the smallest positive value, such that if we set /ii = a, then for all Xij values (after 
incrementing by the vector fi as mentioned above stay in [0,^^/7], and at least one of them becomes 
or yi/7. Similarly let /? be the smallest positive value such that if we set fj-i = —j3, then again all Xij 
values after increments lie in [0, yi/7] and at least one of them is rounded to or 2/4/7. (It is easy to see 
that a and /? always exist and they are strictly positive.) We now choose the vector /x as follows: 

(Rl) Set 11 = f{a), iipvo,vi - Pvo,Vk-il^k/ IJ^i < 0. 

(R2) Set n = f{-f3), if Pvo,vi - Pvo^v^^^^l^k/ P-i > 0. 

If some Xij is rounded to 0, we remove that edge from Gi. If some edge Xij becomes yj/7, then 
we remove it from Gi and add it to G2, with weight yi/j. Since at least one of these occurs, we are 
able to break the cycle. 

Let (j) denote the fractional assignment of x variables at the beginning of the cycle breaking phase. 
Then clearly, after this step, for all jobs j, considering both Gi and G2, — Si 4>i,j- 

For any machine i £ M, if i ^ G, then clearly PijXij = YljPi,o't>i,j- If ^ £ C*, but i 7^ uq, then 
by property (P2), before inserting any edge to G2, we have ^jPijXij = Yl,jPi,j^i,j- edge added 
to G2 after the cycle breaking step has the same weight as it had in Gi. Therefore, we have, for any 
i / Wo, and considering both Gi and G2, J2j Pi,j-^i,j — YlijPi,j4>i,j- Now consider the machine fo(= Vk). 
Its change in load is exactly /ii(puo,fi ~ P?;o,«fc-iMfc//^i)- Therefore by the choice of (Rl) and (R2), the 
load on machine vq can only decrease. Hence, by property (j2.5p . we have the following lemma. 



Lemma 2.4. Considering both Gi and G2, we have after the cycle breaking step with probability 1; 
Ej = 1 ^i; Ej ^i,jPi,j < Tyi Vz; , Xij < yi \/iJ. 

2.6 Rounding on Gi and G2 The previous two steps ensures, that Gi is a forest and in G2, 
Xij > yi/^, for all {i,j) G E{G2)- We remove any isolated nodes from Gi and G2, an round them 
separately. 

2.6.1 Further Relaxing the Solution Let us denote the job and the machine nodes in Gi {G2) 
by J(Gi) (or J(G2)) and M(Gi) (or M(G2)) respectively. Consider a job node j £ J{G2). If 
Er(i j)G£;(G2) ^ ^/"^ choose 5 later), we simply remove all the edges (i,j) from G2 and the 
following must hold: Yji:{i,j)(^E{Gi) ^i,3 - ^ ~ 1/^- Otherwise, if I]i:{ij)e£;{G2) - ^Z"^' remove 
all edges («, j) G E{Gi) from Gi. Therefore at the end of this modification, a job node can belong to 
either J{Gi) or J(G2), but not both. If j G J(Gi), we have EieM^ij > 1 " V-^- Else, if j G J(G2), 

For the makespan analysis it will be easier to partition the edges incident on a machine node i into two 
parts " the job nodes incident to it in Gi and in G2. The fractional processing time due to jobs in J{Gi) 
(or J(G2)) will be denoted by T'yi (or T"yi), i.e., T'lji = J2jeJiGi)PiJ^i,j (o^ '^"Vi = EjeJlGa) ^^^J^^j)- 

2.6.2 Rounding on G2: In G2, for any machine node i, recall Eje j{G2) ~ '^"Vi- Since we 
have for all i G M{G2),j G ^(^2), Xij > yi/j, we have EjGj(G2)^'*J — ^"7- Therefore, if we decide 
to open a machine node i G M(G2), then we can assign all the nodes j G J(G2), that have an edge 
{i,j) G E{G2), by paying at most T"-y in the makespan. 

Hence, we only concentrate on opening a machine in G2, and then if the machine is opened, we 
assign it all the jobs incident to it in G2. For each machine i G M(G2), we define Yi = min{l,yj5}. 
Since, for all job nodes j G J(G2), we know EiGM(G2) — ^/^^ after scaling we have for all j G J(G2), 
E(i j)e£;(G2) — Therefore, this exactly forms a fractional set-cover instance, which can be rounded 
using the randomized rounding method developed in [22] to get activation cost within a factor of 
6{logjjpj^ + 1). The instance in G2 thus nicely captures the hard part of the problem, which comes 
from the hardness of approximation of set cover. Thus we have the following lemma. 

Lemma 2.5. Considering only the job nodes in G2, the final load on any machine i G M(G2) is at most 
T"j and the total activation cost is at most (5(log + l)OPT, where T" is the fractional load on 
machine i G M(G2) before rounding on G2 and OPT is the optimum activation cost. 

2.6.3 Rounding on Gi: For rounding in Gi, we traverse each tree in Gi bottom up. If there is a 
job node j, that is a child of a machine node i, then if Xi j < l/rj {rj to be fixed later), we remove the 
edge (i, j) from Gi. Since initially j G J(Gi), EieAf -^ij — ^~ ^/^^ even after these edges are removed, 
we have for j G J(Gi), EieA/(Gi) -^hj ^ 1 ~ l/*^ ~ 1/^- However if Xij > I/t], simply open machine 
i, if it is not already open and add job j to machine i. Initially iji > l/rj, since yi > Xij. The initial 
contribution to cost by machine i was > -aj. Now it becomes Oj. If ^^-^Pij = T', with Xi j > 1/r], 
now it can become at most r]T'. 

After the above modification, the yet to be assigned jobs in J(Gi) form disjoint stars, with the job 
nodes at their centers. Consider each star, Sj with job node j at its center. Let ii,i2, -liij be all the 

machine nodes in 5j, then we have, Efc=i ^ik,3 ^ ^ ~ ^/ ^ ~ ^M- Therefore Efc=i Vik ^ ^ ~ ^/^ ~ ^hi- 
ll there is already some opened machine, ii, assign j to i; by increasing the makespan at most by an 
additive T. Otherwise, open machine i/ with the cheapest ai^. Since the total contribution of these 

machines to the cost is Efe=i Vik'^ik — Efc=i Vik^ii — (1 — l/i^ — l/r/)ai, , we are within a factor 
of the total cost contributed from Gi. 



Hence, we have the following lemma, 



Lemma 2.6. Considering only the job nodes in Gi, the final load on any machine i G M{Gi) is at 
most T'r] + max.ijpij and the total activation cost is at most max(i, jii:ijj^:ij:^)OPT , where T' is the 
fractional load on machine i S M(Gi) before rounding on Gi and OPT is the optimum activation cost. 

Now combining, Lemma 12.41 12.51 and 12.61 and by optimizing the values of (5, r/ and 7, we get the 
following theorem. 

Theorem 2.1. A schedule can be constructed efficiently with machine activation cost 2(1 + ^)(ln q^j. + 
l)OPT and makespan (2 + e)T, where T is the optimum makespan possible for any schedule with 
activation cost OPT. 

Proof. From Lemma |2 . 51 and 12.61 we have, 

• Machine opening cost is at most ^max(i, ^ + ^ (in oPr + ^)) OPT 

• Makespan is at most T(max(7, ry)) +maxjjpjj 

Now r] > 'J, since otherwise any edge with Xij > l/i] will be moved to G2 and 1—1/5 > l/i]. Now set, 
j = r],6 = 1+C, for some C > 0. So 1-1/S = C/(l+C)- Set 1/r] = C/(1+C)-1/(1+C)(ln o^ + l). Thus, 
we have an activation cost at most 2(1 + C)(ln Qp^ + l)OPT and makespan < T ( 1 + ) + max ^ j pij. 

Therefore, if we set ^ = 1 + 2/ In ?i, we get an activation cost bound of 4(ln + 1)0PT and makespan 
< 2T+maxjj pij. In general, by setting e = ^, we get an activation cost at most 2(l+i)(ln -^pj^+l)OPT 
and makespan < (2 + e)T. □ 

3 Minimizing Machine Activation Cost and Assignment Cost 

We now consider the scheduling problem with assignment costs and machine activation costs. As before, 
each job can be scheduled only on one machine, and processing job j on machine i requires pi^j time 
and incurs a cost of Qj-. Each machine is available for T time units and the objective is to minimize 
the total incurred cost. In this version of the machine activation model, we wish to minimize the sum 
of the machine activation and job assignment costs. Our objective now is 

min ^ OiVi + ^ Q jXjj 

subject to the same constraints as the LP defined in Eq ()2.ip . 

Our algorithm for simultaneous minimization of machine activation and assignment cost follows the 
same paradigm as has been developed in Section [2.31 with some problem specific changes. We mention 
the differences here. 

3.1 Transforming the Solution After solving the LP, we obtain, G = j CijXij. Though, we 
have an additional constraint C = X^j CijXij to care about, we do not include it in the linear system 
and proceed exactly as in Subsection 12.41 As long as the system is under determined, we can repeatedly 
apply RandStep to form the two graphs Gi and G2. By Property 12.61 /i,E X^^j = Xij and 

hence, we have that the expected cost is CijXij. The procedure can be directly derandomized by 
the method of conditional expectation giving an 1-approximation to assignment cost. 

When the system becomes determined, we move to the next step. Thus at that point, in every 
component of Gi, the number of edges is at most the number of vertices. Thus again each component 
of Gi, can consist of at most one cycle. In G2, for all (i, j) E E(G2), we have Xij > yi/^. 



3.2 Breaking the Cycles For breaking the cycle in every component of Gi, we proceed in a slightly 
different manner from the previous section. However, we now have two parameters, pi,j and Cij 
associated with each edge. Suppose is an edge in a cycle. 

If the Xiij value of this edge exceeds ^ then we can assign job j to machine i' and increase the 
processing load on the machine by Piij- This increases the makespan at most by an additive ^, since 
the job was already assigned to an extent of ^ on that machine. The assignment cost also goes up, but 
since we pay Cj/j to assign j to i' , and the LP solution pays at least ^Cj/j, this cost causes a penalty 
by a factor of 2 even after summing up all such assignment costs. Similarly, activation cost is also only 
affected by a factor of 2. 

If the Xi'j value is at most ^, then we simply delete the edge [i' ,])■ We scale up all the Xi^j values 
and iji values by 2. Thus the total assignment of any job remains at least 1 and the cost of activation 
and assignment can go up only by a factor of 2. 

3.3 Rounding on Gi, G2 The first part involves further relaxing the solution, that is identical to the 
one described in subsection 12.6.11 Therefore, we now concentrate on rounding Gi and G2 separately. 

3.3.1 Rounding on G2 In G2, since we have for all G E{G2), Xij = yi/'j, if we decide to 
open machine i, all the jobs j G J{G2) can be assigned to i, by losing only a factor of 7 in the 
makespan. Therefore, we just need to concentrate on minimizing the cost of opening machines and the 
total assignment cost, subject to the constraint that all the jobs in J{G2) must have an open machine to 
get assigned. This is exactly the case of non-metric uncapacitated facility location and we can employ 
the rounding approach developed in [21] to obtain an approximation factor of 0(log f^f) + 0(1) on 
the machine activation and assignment costs. 

3.3.2 Rounding on Gi Rounding on Gi is similar to the case when there is no assignment costs 
with a few modifications. We proceed in the same manner and obtain the stars with job nodes at the 
centers. Now for each star Sj, with j at its center, we consider all the machine nodes in Sj. If some 
machine i G Sj is already open, we make its opening cost 0. Now we open the machine, i € Sj, for 
which Cj + aij is minimum. Again using the same reasoning as in Subsection 12.6.31 the total cost does 
not exceed by more than a factor of jzzfjjzrYj:^ ■ 

Now optimizing a, (3, 7, we get the following theorem. 

Theorem 3.1. If there is a schedule with total machine activation and assignment cost as OPT 
and makespan T, then a schedule can be constructed efficiently in polynomial time, with total cost 
0(log + 1)0PT and makespan < (3 + e)T. 

Note that for both the cases of minimizing alone the machine activation cost and also minimizing 
the assignment cost simultaneously, total cost is bounded within a constant factor of log d, where d is 
the maximum degree (total number of edges incident on the bipartite graph) of any machine node in 
G2. 

4 The Greedy Algorithm 

In this section, we present a greedy algorithm that achieves an approximation factor of (2,1 + Inn). 
The algorithm is similar to the standard set cover type greedy algorithm and runs in iterations. In each 
iteration, the most "cost-effective" set, the set that maximizes the ratio of the incremental benefit of 
the set, to its cost, is chosen and added to our solution set, until all elements are covered. 

Given that a solution with activation cost A and makespan T exists, at each step we wish to select 
a machine to activate based on its "cost-effectiveness". Given a set S of active machines, let F{S) 



denote the maximum number of jobs that can be scheduled with makespan T. However, in this case, 
the quantity F{S), is NP-hard to compute, thus it is unhkely to have efficient procedures either to test 
the feasibihty of the current set of active machines or to find the most cost-effective machine to activate. 
The central idea is that instead of using the integral function F(S) that is hard to compute, we use a 
fractional relaxation that is much easier to compute, and allows us to apply the greedy framework. 

Formally, for a value T, we first set all Pij 's that are larger than T to infinity (or the corresponding 
Xjj to 0). Let f{S) be the maximum number of jobs that can be fractionally processed by a set S of 
machines that are allowed to run for time T each. In other words, 

f{S) = max Xjj (4.7) 
s.t. y^ Xij < 1 Vj G J 
y^PijXjj < T \/i £ S 

< Xij < 1 Xij = ii i ^ S or pij > T 

Note that f{S) can be computed by using a general LP solver or by a generalized flow computation. 
The generalized flow problem is the same as the traditional network flow problem except that, for each 
arc e, there is a gain factor 7(e) and for each unit of flow that enters the arc 7(e) units exit. To see 
that / can be computed by a generalized flow computation, we add a sink t to the bipartite graph 
G{M U J, E) and connect each job to t with an arc with capacity 1. Each edge G M,j G J has 

a capacity pij and gain factor 1/pij- Every machine i £ S has a flow excess of T. It is easy to see the 
maximum amount of flow that reaches t is exactly the optimal solution of LP (j4.7p . 

A function z : 2^ ^ R is submodular if z{S) + z{P) > z{S n P) + z{S U P) for any S,P C N. Let 
z{S) be the maximum amount of flow that reach t starting with the excesses at nodes in S: Recently, 
Fleischer [9] proved the following: 

Lemma 4.1. (Fleischer) For any generalized flow instance, z{S) is a submodular function. 

It is a direct consequence that f{S) is submodular. 

Deflne gain{i, S) = f{S U i) — f{i) for any i £ M and S C M. Our greedy algorithm starts with 
an empty set S of active machines, and activates a machine s in each iteration that maximizes ^°^"(^''^) 
until f{S) > n—1. We then round the fractional solution to an integral one using the scheme by Shmoys 
and Tardos [20]. 

Algorithm GREEDY-SCHEDULING 

5 = 0; 

While(/(5) < n - 1) do 

Choose i £ M \ S such that jg maximized; 

S = SU{i}; 
Activate the machines in set S; 

Round f{S) to an integer solution to find an assignment. 



The problem is actually a special case of the submodular set cover problem: mm{^j^g aj \ z{S) = 
z{N), S C A^} where 2; is a nondecreasing submodular function. In fact, Wolsey [23] shows the following 
result about the greedy algorithm, rephrased in our notation. 



Theorem 4.1. (Wolsey) Let St he the solution set we have chosen after iteration t in the greedy 
algorithm. Then, 



where OPT is the optimal solution. 

In particular, if /() is integer- valued, the theorem yields a 1 +lnn approximation. However, /() is 
not necessarily integral in our problem. Therefore, we terminate iterations only when more than n — 1 
(rather than n) fractional jobs are satisfied, thus f{M) — f{St-i) > 1 and Theorem 14.11 gives us a 
(1 + lnn)-approximation for the activation cost. 

Finally, we would like to remark that the rounding step guarantees to find a feasible integral solution 
although the fractional solution we start with only satisfies more than n — 1 jobs. The reason lies in the 
construction by Shmoys and Tardos (refer to [20] for more details). Therefore, there exists an integral 
matching such that all jobs are matched. Moreover, it is also proven that the job assignment induced 
by any integral matching has a makespan at most T + max pij. Therefore, our final makespan is at most 
2T. 

5 Extensions 

5.1 Handling Release Times Suppose each job j has a machine related release time rij, i.e, job j 
can only be processed on machine i after time rij. We can modify the algorithm in Section 2 to handle 
release times as follows. 

For any "guess" of the makespan T, we let Xij = if r^ + pij > T in the LP formulation. Then, 
we run the ((2 + e),2(l + i)(ln 

(JPT + l))-approximation regardless of the release times and obtain a 
subset of active machines and an assignment of jobs to these machines. Suppose the subset Jj of jobs is 
assigned to machine i. We can now schedule the jobs in Jj on machine i in order by release time. It is 
not hard to see the makespan of machine i is at most T + Xljg j, Pij since every job can be scheduled on 
machine i after time T. Therefore, we get a (3 + e, 2(1 + 7) (log Qp^p + 0(1))) approximation. Similar 
extensions can be done for the case with activation and assignment costs. 

5.2 Scheduling with Outliers We now consider the case where each job j has profit irj and we 
are not required to schedule all the jobs. Some jobs can be dropped but the total profit that can be 
dropped is at most 11'. Therefore the total profit earned must be at least ttj — 11' = 11. We now 
show how using our framework and a clever trick used in [TT], we can obtain a bound of (3 + e) on the 
makespan and 2(1 + i)(ln + 1) on the machine activation cost, while guaranteeing that profit of at 
most n'(l + e) is not scheduled. If we consider both machine activation and assignment cost, then we 
obtain a total cost within 0(log + 0(1)) of the optimum without altering the makespan and the 
profit approximation factor. 

We create a dummy machine dum, which has cost Odum = and for all j, Cjj = 0. Processing time 
of job j on dum is ttj. It is a trivial exercise to show that both the algorithms of the previous sections 
work when the makespan constraint is different on different machines. If the makespan constraint on 
machine i is Tj, then we the makespan for machine i is at most (1 + e)Tj + maxjPij. For the dummy 
machine dum, we set a makespan constraint of 11'. Since after the final assignment the makespan at the 
dummy node can be at most (1 + e)n' + maxj ttj. With some work it can be shown that we can regain 
the lost profit for a job with maximum profit on dum, to either an existing machine or by opening a 
new machine. This either increases our cost slightly, or increases the makespan to at most (3 + e)T. 




6 Minimizing Machine Activation Cost in Uniformly Related Machines 

In this section, we show that for related parallel machines, there is an polynomial time (1 + e, 1)- 
approximation for any e > 0. If a schedule with activation cost A and makespan T exists, then we find 
a schedule with activation cost A and makespan at most (1 + e)T. 

We briefly sketch the algorithm which is a slight generalization of the approximation scheme for 
makespan minimization on related parallel machines by Epstein and Sgall [7] . Actually, their algorithm 
can optimize a class of objective functions which includes for example makespan, Lp norm of the load 
vector etc. We only discuss the makespan objective in our paper. The extensions to other objectives 
are straightforward. 

Roughly speaking, Epstein and Sgall's algorithm works as follows (see [7] for detailed definitions 
and proofs) . They define the notion of a principal configuration which is a vector of constant dimension 
and is used to succinctly represent a set of jobs (after rounding their sizes). A principal configuration 
(see Appendix |B] for more details) is of the form {w, n) where w = Q or w = 2'' for some integer i and 
n is a vector of non-negative integers. The number of different principal configurations is polynomially 
bounded (for any fixed e > 0). They also construct the graph of configurations in which each vertex is 
of the form [i,a{A)) for any 1 < i < m and principal configuration a{A) of the job set A d J. There 
is a directed edge from (i — l,a) to («,«') if a' represents a set of jobs that is a superset of what a 
represents and its length is the (1 + e)-approximated ratio of the weights of the jobs in the difference 
of these two sets to the speed Si of machine i. Intuitively, an assignment Ji, . . . , with jobs in Jj 
assigned to machine i corresponds to a path P = {(i, oci)}i in G such that represents U*-^;^ and the 
length of edge ((i — 1, a^-i), («, on)) is approximately the load of machine i. By computing a path P in 
G from (0, a(0)) to (m, a( J)) such that the maximum length of any edge in P is minimized, we can find 
an 1 + e approximation for minimizing the makespan. 

To obtain a (1 + e, l)-approximation of the machine activation problem, we slightly modify the 
above construction of the graph as follows. The sets of vertices and edges are the same as before. We 
associate each edge with a cost. If both endpoints of edge ((i — 1, ai-i), (i, a^)) have the same principal 
configuration ai_i = on , then the cost of the edge is 0; Otherwise, the cost is the activation cost Oj of 
machine i. For the guess of the makespan T*, we compute a path from (0, a(0)) to (m, Oi{J)) such that 
the maximum length of any edge in P is at most and the cost is minimized. If T < (1 + e)r*, we 
are guaranteed to find a path of cost at most A. 

7 Conclusions 

Current research includes considering different Lp norms as well as other measures such as weighted 
completion time. The greedy approach currently only works for the most basic version giving a makespan 
of 2T and an activation cost of 0{\ogn)A . Extending it to handle other generalizations of the basic 
problem is ongoing research. 

Acknowledgments: We thank Leana Golubchik (USC) and Shankar Ramaswamy (Amazon) for useful 
discussions. 
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Appendix 

A Partial GAP (No activation costs) 

Suppose each job earns a profit of ttj. There are n jobs and m machines. We wish to schedule a subset 
Sj of jobs of total profit at least H. Job j has a processing time of pij if it is assigned to machine i and 
has an assignment cost of Cij. We show that if an assignment exists for a subset of jobs Sj with the 
property that TT{Sj) > IT, such that this assignment has cost C and makespan T, then in polynomial 
time we can find an assignment with expected cost C and expected profit 11 with makespan at most 2T. 

The idea is extremely simple. We first solve the following LP relaxation. We have an integer 
variable yi which is 1 if and only if job i is scheduled. The first constraint states that the total profit of 
scheduled jobs is at least 1. The second constraint ensures that all jobs that are scheduled are assigned 
to a machine. The third constraint ensures that the total cost is not too high. 



J2 T^jVj > n 

j€J,ieM 

X J <T\/i£ M 
< Xij < 1,0 < < 1 

The high level idea is as follows: suppose we have a fractional solution satisfying the above constraints 
(as in [20], if pij > T then we set Xjj = 0). We create a bipartite graph as follows - let G = (J, P, E) 
be a bipartite graph where J is the set of job nodes (one vertex for each job) associated with a yj value 
(the extent to which this job is done). For each machine node i in M, let Yljej^i,] — ■^i- We create 
Pi = \Zi\ nodes corresponding to each machine i. Set P = {{i,k)\ii € M,\/k = l...Pi}. For each 
machine node i, we order the jobs assigned to it by the fractional solution in non-increasing pij order 
such that the fractional load on each copy, except for the last copy, is exactly 1. The main insight here 
is that this lets us essentially ignore the processing times of jobs, as long as we can map this solution 
to an integral assignment in which the set of jobs assigned to a particular machine are the set of jobs 
that are matched to the copies of i in P. This part is almost identical to the construction in [20] . 

From this fractional solution we can compute an integral solution by using dependent rounding on 
bipartite graphs [lO] to convert the fractional solution to an integral solution. Each edge is associated 
with a value Xij defined by the solution to the linear program. In addition the fractional degree of each 
job node is exactly y,. The randomized rounding converts each Xij to Xij (an integral value), such 
that Pr[Xij = 1] = Xij. In addition, each node has degree exactly or 1, such that a job node j node 
is matched with probability exactly yi. These properties ensure that the expected cost is C, and the 
expected benefit is at least Yljej "^jUj — '^^^ proof that the makespan is at most 2T is the same as 
the proof given in [20j . 

Derandomizing this method achieving the cost and benefit bounds would be quite interesting. If all 
the TTj values are identical, then instead of using dependent rounding, one can use a direct conversion 
of the fractional matching to an integral matching, maintaining the benefit value and cost values. 

B More Details of the Construction [7] 

Let ^ C J be a set of jobs. Suppose u) is or 2* for some integer i (possibly negative). Let the relative 
rounding precision be 5 > and A be such that A = 1/(5 is an even integer. Given A and w, define 
A{w) = {j e A\pj < 6w}. 

Definition B.l. 1. The rounding function r(p) : Let w be the largest power of two such that 
p > 6w and i be the smallest integer such that p < id'^w. r{p) = iS'^w. It is easy to see 
Pj < r{pj) < {1 + S)pj. 

2. A configuration is of the form {w,n) where n = {n\,n\+i, . . . ,nx2} is a vector of non- 
negative integers. A configuration {w,fi) represents A if (i) pj < w for all j G A; (2) 
for X < i < A^, Ui equals the number of jobs j ^ A with r{pj) = id'^w; (3) n\ G 
{[T.,eAi^)r{p,)/{5w)l\Y.,^Ai^)r{p,)/{5w)^}. 



3. The principal configuration a{A) of A is a configuration {w, n) with the smallest w which represents 
A and ux = EjeAM • 

^. The scaled configuration for {w,n) and w' > w is defined as a vector scale^_»^/(n) = n' such that 
{w' , ft') represents the set K containing exactly ni jobs with processing time id'^w for i = X, . . . , A^, 
and no other jobs. Choose the configuration with — n'^5w'\ < 6w'/2, breaking 

ties arbitrarily. 

Intuitively, a single principal configuration succinctly represent many different sets of jobs that 
are approximately equivalent. It is also not hard to see the number of principal configurations is 
polynomially bounded for any fixed 6. 

The following definition describes the construction of the configuration graph G. The construction 
is the same as in [7], except that we have two metrics on edges, length and cost, which are used to 
capture respectively the makespan and machine opening cost. 

Definition B.2. Assume that the machines are numbered in non- decreasing speed order. The 
configuration graph G: each vertex is of the form {i,a{A)) for any 1 < i < m where a{A) is the principal 
configuration of the job set A C J. The source is (0, a(0)) and the sink is (m, a(J)). There is a directed 
edge from {i—l,{w,n)) to {i,{w',n')) iff either [w , n) = {w',fi') or ft" < ft' and'^^^^{n'-—n'-)6'^w' > w'/3 

where n" = scale^_»^/(n). The length of the edge is i'^i=xi''^i ~ n'-)5'^w')/ Si. The cost of the edge is 
if {w,n) = [w' ,n') and the opening cost ai of machine i otherwise. 

The following definition is essential for establishing the relation between a path of the configuration 
graph and an job assignment. 

Definition B.3. Let Ji, . . . , Jm be a schedule assigning jobs in Ji to machine Mi. A sequence 
{i, {wi,ni)}'^f) of vertices of the graph G represents (is a principal configuration of) the assignment 
if {wi,ni) represent (is a principal configuration of) U*,^-^Jj/. 

Lemma B.l. Let {i — l,(w,n)) be a configuration representing A C J , and {{i — 1, {w,n), — {w' ,n')))) 
be an edge in G. We can find in linear time a set of jobs B such that A C B and {w' ,n') represents B. 

Lemma B.2. 1. Let {Ji} be an assignment. Then its principal representation {(i, (wi,nj)} is a path 
in G. 

2. Let {i, {wi,ni)}^Q be a path in G representing an assignment {Ji}. Let be the maximum length 
of any edge in P and T be the makespan of the assignment. Then \T — T*| < 5T . 

For the guess of the makespan T*, we compute a path from (0, a(0)) to (m,a(J)) such that the 
maximum length of any edge in P is at most and the cost is minimized. By Lemma IB. 11 we can 
efficiently construct an assignment represented by this path. Let {J*} be the assignment with makespan 
T* and cost A* . From Lemma IB. 21 we know there is path of cost A* and the maximum edge length at 
most (1 + 6)T* . Hence, if our guess T# > (1 + 5)T* , we can guarantee to find a path of cost at most 
A* . Again by Lemma IB. 2( 2). we know the makespan of the assignment represented by the path is at 

most T*/{l-8) < (i^)t*. 



